Multivariate Theory for Analyzing High Dimensional Data
نویسندگان
چکیده
منابع مشابه
Analyzing high-dimensional multispectral data
In this paper, through a series of specific examples, we illustrate some characteristics encountered in analyzing high dimensional multispectral data. The increased importance of the second order statistics in analyzing high dimensional data is illustrated, as is the shortcoming of classifiers such as the minimum distance classifier which rely on first order variations alone. We also illustrate...
متن کاملMethods for regression analysis in high-dimensional data
By evolving science, knowledge and technology, new and precise methods for measuring, collecting and recording information have been innovated, which have resulted in the appearance and development of high-dimensional data. The high-dimensional data set, i.e., a data set in which the number of explanatory variables is much larger than the number of observations, cannot be easily analyzed by ...
متن کاملAnalyzing High-Dimensional Data by Subspace Validity
We are proposing a novel method that makes it possible to analyze high dimensional data with arbitrary shaped projected clusters and high noise levels. At the core of our method lies the idea of subspace validity. We map the data in a way that allows us to test the quality of subspaces using statistical tests. Experimental results, both on synthetic and real data sets, demonstrate the potential...
متن کاملKNN-kernel density-based clustering for high-dimensional multivariate data
Density-based clustering algorithms for multivariate data often have difficulties with high-dimensional data and clusters of very different densities.A new density-based clustering algorithm, called KNNCLUST, is presented in this paper that is able to tackle these situations. It is based on the combination of nonparametric k-nearest-neighbor (KNN) and kernel (KNN-kernel) density estimation. The...
متن کاملMultivariate tests for the evaluation of high-dimensional EEG data.
In this paper several multivariate tests are presented, in particular permutation tests, which can be used in multiple endpoint problems as for example in comparisons of high-dimensional vectors of EEG data. We have investigated the power of these tests using artificial data in simulations and real EEG data. It is obvious that no one multivariate test is uniformly most powerful. The power of th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: JOURNAL OF THE JAPAN STATISTICAL SOCIETY
سال: 2007
ISSN: 1348-6365,1882-2754
DOI: 10.14490/jjss.37.53